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Background: Angiogenesis is involved in tumor growth, macular degeneration, 
retinopathy and other diseases. Vascular endothelial growth factor (VEGF) 
stimulates angiogenesis by binding to specific receptors (VEGFRs) on the 
surface of vascular endothelial cells. VEGFRs are receptor tyrosine kinases that, 
like the platelet-derived growth factor receptors (PDGFRs), contain a large 
insert within the kinase domain. 

Results: We report here the generation, kinetic characterization, and 2.4 A 
crystal structure of the catalytic kinase domain of VEGF receptor 2 (VEGFR2). 
This protein construct, which lacks 50 central residues of the 68-residue kinase 
insert domain (KID), has comparable kinase activity to constructs containing the 
entire KID. The crystal structure, determined in an unliganded phosphorylated 
state, reveals an overall fold and catalytic residue positions similar to those 
observed in other tyrosine-kinase structures. The kinase activation loop, 
autophosphoryfated on Y1 059 prior to crystallization, is mostly disordered; 
however, a portion of it occupies a position inhibitory to substrate binding. The 
ends of the KID form a p-like structure, not observed in other known tyrosine 
kinase structures, that packs near to the kinase C terminus. 

Conclusions: The majority of the VEGFR2 KID residues are not necessary for 
kinase activity. The unique structure observed for the ends of the KID may also 
occur in other PDGFR family members and may serve to property orient the KID 
for signal transduction. This VEGFR2 kinase structure provides a target for 
design of selective anti-angiogenic therapeutic agents. 
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Introduction 

Many physiological events including embryogenesis, organ 
development, estrus, and wound healing require vascular 
growth and remodeling [1,2]. In addition to these benefi- 
cial processes, angiogenesis is also involved in the prolifer- 
ation of disease states such as tumor growth, metastasis* 
psoriasis, rheumatoid arthritis, macular degeneration and 
retinopathy [3-7]. Of the signaling pathways known to 
influence vascular formation, that involving vascular endo- 
thelial growth factor (VEGF) has been shown to be essen- 
tial and selective for vascular endothelial cells [8-10]. The 
therapeutic potential of inhibiting the VEGF pathway has 
been directly demonstrated by anti-VEGF monoclonal 
antibodies, which are active against a variety of human 
tumors [11] and ischemic retinal disease [12]. 

VEGF is a homodimeric cytokine that is expressed in at 
least four splice-variant forms of 121-206 residues [10]. 
Vascular endothelial cells express at least two high-affin- 
ity receptors for VEGF: VEGFR1 (also referred to in the 



literature as Fk-1) and VEGFR2 (also referred to as 
KDR). Another related receptor, VEGFR3/FU-4, binds the 
VEGF-like growth factor, VEGF-C, believed to function in 
lymphogenesis [13]. These VEGFRs are receptor tyrosine 
kinases each comprised of an extracellular domain that 
binds VEGF and contains seven immunoglobulin-like 
segments, a short membrane-spanning region, and a cyto- 
solic domain possessing tyrosine-kinase activity. The kinase 
domain directly follows the extracellular and juxtamem- 
brane regions and is itself followed by another domain 
(referred to here as the post-kinase domain), which may 
function in binding of other proteins for signal transduction. 
VEGFR1 and VEGFR2 appear to have different signaling 
pathways and functions, with VEGFR2 being of primary 
importance in mitosis of endothelial cells [14-16]. 

On the basis of sequence homology and overall domain 
structure, VEGFRs belong to the platelet-derived growth 
factor receptor family (PDGFR), which also includes 
PDGFRoc, PDGFRp, the stem cell growth factor receptor 
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(c-kit), and the colony stimulating factor-1 receptor 
(CSF-'lR; also referred to as c-fms) [17]. Compared to 
other protein kinases, members of this family contain an 
insert of approximately 63-97 residues, termed the kinase 
insert domain (KID), within the kinase domain. Deletion 
or mutation of the KID from PDGFa, PDGFp, ckit, and 
CSF-1 R has indicated that this domain is not necessary for 
intrinsic kinase activity but that it is important for the 
binding of other proteins involved in signal transduction 
via autophosphorylation of KID tyrosine residues [18-24]. 
Although the signaling pathways and the specific role of 
the KID are still not fully determined for VEGFRs, the 
VEGFR2 KID does contain two tyrosines that have been 
shown to be autophosphorylation sites [25]. 

Since the determination of the first cyclic-AMP-depcn- 
dent protein kinase (cAPK) structure [26) a variety of 
protein kinase structures have been reported [27]. Among 
the receptor protein tyrosine kinases (RPTKs), structures 
of the kinase domain of the insulin receptor (IRK) [28,29] 
and the fibroblast growth factor receptor 1 (FGFR1) [30,31] 
have been determined. We report here the generation, 
kinetic characterization, and structure determination of a 
modified kinase domain of VEGFR2 containing 18 residues 
of the 68-rcsiduc KID. This 2.4 A crystal structure of the 
phosphorylated VEGFR2 catalytic domain is the first 
reported structure of a kinase domain of the PDGFR 
famiiv This structure provides insights into the orienta- 
tion of the KID domain of VEGFR2, which may be rele- 
vant to other PDGFR family members. Furthermore, as 
inhibition of VEGFR2 kinase has broad clinical potential, 
this structure provides a three-dimensional description of 
the target for structure-based design of small molecule 
VEGFR2 inhibitors as therapeutic agents. 

Results and discussion 
Kinetic analysis 

The tyrosine kinase domain of human VEGFR2 lacking 50 
central residues of the 68 residues of the KID was expressed 
in a baculovirus/insectcell system. Of the 1356 residues of 
full-length VEGFR2, this protein construct (VEGFR2A50) 
contains residues 806-939 and 990-1171 of the cytosohc 
domain (Figure 1). In addition to the 50-res.due ddeuon 
VEGFR2A50 also contains one point mutation <E990\ ) 
within the KID relative to wild-type VEGFR2. 

Purified VEGFR2A50 was found to efficiently catalyze 
its autophosphorylation as well as phosphorylation of a 
- poly(E 4 Y) exogenous substrate. The kinetic methods and 
analysis reported here are very similar to those previously- 
reported in detail for another VEGFR2 kinase doma.n 
protein construct [32). 

Using a spectrophotometry assay kinetic constants were 
determined for both native, nonphosphorylated and auto- 
phosphorylated VEGFR2A50 (VEGFR2A50P) (Table 1). 



Kinetic characterization of the native nonphosphorylated 
protein was possible because, under the low protein corcen 
trations used in the activity assays, no autophosphoryU tioi 
of the protein was detected. Quantitation of the ATPase 
activity and the phosphotransferase activity indicated that 
the two reactions were tightly coupled. No ATPase activity 
was observed in the absence of a peptide substrate. 



inter- 
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Since phosphotyrosines are considered high-energy 
mediates, the tyrosine-kinase reaction should, in pr 
pie, be reversible. In fact, the enzyme catalyses ger era 
tion of ATP from ADP and phosphorylated poly( 
lp-poiy(E 4 Y)] in the presence of Mg 2+ . Although cat 
of catalyzing the reverse reaction, VEGFR2A50 clearly 
exhibits a commitment to forward reaction as evidenced 
by higher * cat for the forward direction relative to that for 
the reverse'direction: 13.0 and 0.13 s" 1 , respectively for 
VEGFR2A50P (Table 1). 

It is interesting to note that for both nonphosphoryl ited 
and phosphorylated VEGFR2A50 the J£ M{Mft M>P> is !css 
than A' M , Mf ,. vrP) and K m ^ ]yo]yEAY) is less than K m<po k 
However, the relative differences in these K m value? are 
much smaller than the relative increase in values 
between the forward and reverse reactions such that the 
Jt JK^ values are larger in the forward direction (Tabl ; 1). 

Comparison of the kinetic constants for VEGFR2A50 vith 
those for a VEGFR2 kinase domain containing the entire 
KID (cchVEGFR2) [32] shows that the 50-residue (ID 
deletion in VEGFR2A50 does not have a signifij^ant 
impact on in vitro catalytic activity. For example, > ca 
VEGFR2A50P and phosphorylated cchVEGFR2 in 
forward direction were 13 and 12 s'\ respectively. Simi- 
larly K M(M ^ T[ > } for VEGFR2A50P and phosphoryhted 
cchVEGFR2 were 150 and 153 mM, respectively. 

Structure determination 

The VEGFR2 KID sequence is hydrophilic and hi ghly 
charged, containing six lysine, five arginine, eight gluts .mic 
acid, and flveaspartic acid residues (Figure 1). Initially, 
several protein constructs containing the VEGFR2 cat- 
alytic domain with the entire KID were generated (unpuh- 
lished results). After exhaustive attempts to crystallize 
these protein constructs failed to yield even marginal < rys 
tals, the VEGFR2A50 construct was created to test * u - 
idea that the highly charged KID was interfering 



for 
the 



the 
vith 



crystallization. As determined by dynamic light scattc ring 
this VEGFR2A50 construct, which eliminated 14 cha ged 
residues, exhibited markedly more stability and less inul- 
timeric aggregation at various temperatures and prcjtem 
concentrations than protein constructs containing the entire 
KID (data not shown). 

For crystallization, purified VEGFR2A50 was autophosj>ho- 
rylated in vitro by incubation with MgATP. Matrix-assisted 
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Figure 1 



Sequence alignment of the kinase domain of 
VEGFR2 and other RPTKs. Secondary 
structural elements of VEGFR2A50P are 
indicated: a helices are designated as ccC-al, 
p strands are designated as p1 -p8, and loops 
are indicated by curves. The site of the 50- 
residue deletion in VEGFR2A50 is indicated 
by |. The site of the E990 V mutation in 
VEGFR2A50 is denoted by an *. Sequences 
are from VEGFR2 (reported here); FGFR1 
(SWISSPROT accession number P1 1362); 
IRK (EMBL database number A1 8657; 
numbering as in [30]); VEGFR1 
(SWISSPROT number P17948); PDGFRa 
(SWISSPROT number Pi 7948). 
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Nucteotide-binding 

806 MDPDELPLDEHCE^LPYDASKVffiFPRDRLKLGKPLGRGAFGQVIEADAFGIDKTATCR-- 863 

456 — mlagvsey--fxp~edprwelprdrlvlckpu:egcfgqvvlaeaigldkdkphrvt 309 

9 ? 8 VFPCSVyVPDEWEVSREKTTLLRELGOGSFGMVYEGNARDIIKGEAE--r 1025 

799 HDPDEVPLDEOCERLPYOASKVJEFARERLKLGKSLGRGAFGKWQASAFGIKKSPTCR-- 856 

576 DPMQLPYD-SRWEFPRDGLVLGRVLGSGAFGKWEGTAYGLSRSQPW-- 622 

864 1VAVKMLKEGATHSEHRAI^SELKILIHICHHLHVVWIXGACTKPGGPIJ«VIVEFCKFGN 923 

510 KVAVKXLKSDATT^DL5DLISEMEMMKMIG}OC<NI iriLLGACT-QDGPLYVIv^EYASKGN 568 

102S RVAVKrVMESASLR£RIEri^EASV>IKGrTCH-HWRLl^WSK-G^PTL\A^ELMAHGO 1083 

B 5 7 7VAVKKLKEGATASEYKALMTELK I LTH IGHHLNWNLLCACTKQGGPLMVI VEYCKYGN 916 

623 KVAVKXLKPTARSSEKQA^MSELKIMTHLGPHLNIVSLLGACTK-SGPIYIITEYCFYGD 681 

otD- Kinase insert domain 

924 LSTYLRSKRNEFVPYiJtKGARFROGKDYVG 953 

569 LREYLOARRPPGLEYCYN 586 

1084 MCSYLRSLRPEA 1095 

917 LSNYLKSKRDLFFLNKDAALHMEPKKEKME 946 

682 LVNYL:iKNRDSFLSHHPEKPKKELDIFGIJJPADESTRSYVrLSFEKNGDY>mMKOADTT>g 741 
OLE 

954 ---ai^dlkrrldsitssqssassgfveekslsdveeeIeapedlykdfltlehlicysf 1010 

S87 PSHNPEEOLSSKDLVSCAY 605 j 

1096 - SMNPGRPPPTLQEMIQHAA 1114 

947 --PGLliOGKKPRLDSVTSSESFASSGFQSDKSLSDVEESEDSDGFYKEPITKEDLISYSF 1004 

742 YVPHL^.RKEVSKYSDTQRSLYDRPASYKK- KSMLDSEVKNLLSDDNSEGLTLLDLLSFTY 800 

^^^^atalytic loo P^ JS7^ Activation loop _ 

1011 QVAKGKEFIASRKCIHRDLAARKILLSEKNVVKICDFGLARDIYKDPDYVRKGDARLPLK 1070 

606 QVAJtGltEYLASKKCIHRDLAARKVLVTEDNVHKIADFGI-ARDIHHIDYYKKTTNGRLPVK 665 

1115 EIADGHAY LKAKKFVHRDLAARKCHVAHDFTVK I GDFGMTRDI Y ETDYYRKGGKGIXPVR 1174 

1005 QVAJ^GI-tEFLSSRKCIHHDLAARKl LLSE^WVVKICDFGLARDI YKNPDYVRKGDTRIrPLK 1064 

801 QVARGIlEFLASKNCVHRDLiAARNVLIJVQGKI\raiCDFGLARDIMHDSWYVSKGSTFbP\n; 860 

— ^ aEF aF aG 

1071 WMAPET I FDRVYTI QSDVWSFGVLLWEI FS LGASPY PCVK I DEEFCRRLKEGTRMRAPDY 1130 

666 WMAPK\liFDRIYTHQSDVWSFGVLLWEIFTLGGSPYPGVPVEELF-KL.l,KEGHRMDXPSrc 724 

1x75 VMAPE:>LKDGVFTTSSD!MIVSFGVVLWEITSIjAEQPYQGLiSKEOVL- KFVMDGGYLDQPDIi 1233 

1065 WMAPK5 IFDKIYSTKSDVWSYGVLLWEIFSLGGSPYPGVQHDEDFCSRltREGMRMRAPEY 1124 

861 WMAPE5 I FDN1.YTTLSDVWS YG I L.LWE I FSLGGTPY PGMMVDSTFYNXI K SGYRMAKPDH 920 
«XH ttl 

1131 TTPEMY QTMLDOtfHGEPSQRprFSELVEHLGNL^OANAQQD 1171 

725 CTNEl/fMKMRDCWHAVPSOKPTFKQLVEDLDRIVALTSSQE 765 

1234 C PERVTDLMRKCWQFN PNMRPT FLE I VKLLKDHuH PS FPEV 1274 

112G STPEIYQIMLDa-nrRDPKERPRFAELVEKLGDLLQAN\'QQD 1160 

921 ATSE^fEIHVKCWNSEPEKRPSFYHLSEIVENLLPGQYKKS 961 

Structure 



laser dcsorption ionization (MALDI) and nanoelectro- 
spray ionization (NanoESI) mass spectrometry analysis of 
full-length phosphorylated VEGFR2A50 (VEGFR2A50P) 
protein and tryptic-digestion peptides indicates phosphory- 
lation of Y1059. Crystals diffracting to 2.2 A were obtained 
of VEGFR2A50P in an unliganded state. The crystals 
belong to the orthorhombic space group P2 1 2,2, 1 with one 
VEGFR2A50P molecule in the asymmetric unit. 

To check that the protein had not dephosphorylatcd prior to 
or during the crystallization process an anti-phosphotyrosinc 
western blot analysis of dissolved crystals was performed 
(Figure 2). In this analysis samples were run of native non- 
phosphorylated VEGFR2A50, VEGFR2A50P taken at two 
time points during rhe a i ^phosphorylation reaction, and 
dissolved crystals. As Figure 2 shows, nonphosphorylated 



VEGFR2A50 did not appreciably bind to the anti-phos- 
photyrosine antibodies, whereas the autophosphorylated 
VEGFR2A50P and dissolved crystals showed a strong inter- 
action. That the crystals contain phosphorylated protein is 
also implied by our observation that nonphosphorylated 
VEGFR2A50 did not crystallize under the crystallization 
conditions reported here for VEGFR2A50P. 

Initial crystallographic phases were determined by mol- 
ecular replacement using the structure of the unphospho- 
rylated kinase domain of FGFR1 [30] as a search model. 
The correctness of the molecular replacement solution 
was cross-checked by calculating a difference Fourier with 
a gold cyanide derivative. The derivative data, however, 
were nor included for phase calculations of electron- 
density maps used to build the model. The structure has 
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Table 1 



Kinetic constants of VEGFR2A50 






Native VEGFR2A50 




Phosphorylated VEGFR2A50 


Substrate 


K M (mM) 


Us") 




K M (mM) 






Forward reaction 
MgATP 
poly(E 4 Y) 
Mg 2 + 

Reverse reaction 

MgADP 

p-poly(E 4 Y) 


0.597 

9.4 

3t.O 

0.127 
1.3 


5.5 
0.1 


9.2 x10 3 
5.Bx 10 2 
1.8x10 a 

7.9x10 a 
7.7 x10 1 


0.153 

2.1 

6.8 

0.056 
1.0 


13.3 
0.13 


87x10 3 
63x102 
20 x 10 2 

23x10 2 
13X10 1 



been refined to an R factor of 20.2% for A data 

(F 0 >2o). VEGFR2A50P residues for which backbone 
atoms were not modeled due to disorder include the N-ter- 
minai residues 806-819, residues 990-997 of the KID, 
residues 1048-1063 of the activation loop, and residues 
1 169-11 71 of the C terminus. Structure-determination sta- 
tistics are included in Table 2. 

Overall kinase fold 

Analogous to previously reported structures of both serine/ 
threonine and tyrosine protein kinases, VEGFR2A50P is 
folded into two lobes, with catalysis of phosphotransfer 
taking place in a cleft between the two lobes (reviewed in 
[27,33J). A Ca trace of the VEGFR2A50P structure is shown 
in Figure 3a. Kinase secondary structural elements are desig- 
nated (Figure 1 ) according to the convention originally given 
for cAPK (26], The N-terminal lobe (approximately residues 
820-920) folds into a twisted 0 sheet with one a helix (aC). 
The (3 structure comprises five antiparailel strands (01-05), 
three of which (01-03) are highly curved and curl over the 
other two strands ((34-05). The larger C- terminal domain 
(approximately residues 921-1168) contains two antiparailel 
0 strands (07-08), which lie at the top of the C-terminal 
domain adjacent to the N-terminal 0 sheet. Seven a 
helices (ocD, aE, aE-F, aF, aG, aH, al) form the remain- 
ing core of the C-terminal domain. Like other kinases, 
VEGFR2A50P also contains functionally important loop 



Figure 2 




2 4 6 1° 



Phosphorylation state of VEGFR2A50 crystals. A cross section of an 
antiphosphotyrosine western blot is shown. Regions of the blot not 
shown did not contain visible bands. Lane 2, purified VEGFR2A50; 
Lane 4, purified VEGFR2A50 after 20 min of autophosphorylation; 
Lane 6^ purified VEGFR2A50 after 40 min of autophosphorylation; 
Lane 1 0, dissolved crystals. 



regions: the glycine-rich nucleotide binding loop (residues 
841_^) i the catalytic loop (residues 1026-1033) and the 
activation loop (residues 1046-1075; Figures 1 and 3a). 

Of the reported kinase structures, the VEGFR2A50P struc- 
ture resembles most closely that of the catalytic domain 
of FGFRl [30] with which it shares approximately 55% 
sequence identity (Figure 1). As the two molecules in the 
crystallographic asymmetric unit of the FGFR1 structure 
solution are very similar, comparisons to VEGFR2A50P 
will primarily be described only for the FGFRl molecule 
A. Least-squares superposition of 82 Cot positions of strands 
(01-05) of the N-terminal lobe or 152 Ca positions of 
helices (aD, aE, aF, aG, aH, al) of the C-terminal lobe 
between FGFRl and VEGFR2A50P result in respective 
root mean square deviation (rmsd) values of 0.40 A and 
0.52 A. A relative rotation of approximately 5 C between the 
two lobes results in the interlobe cleft of VEGFR2A50P 
being slightly larger and more open. Measurement of dis- 
tances between equivalent Ca atoms (K523 and R675 of 
FGFRl, S877 and R1080 of VEGFR2A50P) at the ends of 
the cleft reveal that this distance is 25.3 A in VEGFR2A50P 
in comparison with 23.2 A in FGFRl. This is, however, a 
minor difference in comparison with much larger relative 
lobe rotations observed among kinase structures in various 
complexes and phosphorylation states [27]. For example, 
the interlobe orientation seen here for VEGFR2A50P is in 
an approximately 20° more open conformation than that 
seen in the ternary complex structure of the phosphory- 
lated kinase domain of IRK bound to the ATP analog, 
adcnylyl imidodiphosphatc (AMP-PNP)and a peptide sub- 
strate [29] (Figure 3c). 

Although the 0-strand positions of the N-terminal lobe 
agree well between VEGFR2A50P and FGFRl, the struc- 
tures do diverge significantly at the N-terminal residues 
preceeding the first conserved region, which starts at residue 
W827 (Figure 3a and 3b). The first 14 residues (M806- 
E819) of VEGFR2A50P are completely disordered and the 
next seven residues (L820-R826) form an extended-loop 
structure. It is likely that residues 806-819 do not form 
part of the active kinase region but arc instead part of, or 
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Table 2 



VEGFR2A50P structure determination statistics 



Data set 


Native (3) 


Native (1) 


Native (2) 


KAu(CN) a 


Data resolution (A) 


15-2,2 


20-2.9 


15-2.4 


15-3.1 




5.2(19.6) 


8.4 (26.6) 


7.0 (21 .9) 


7.1 (19.5) 


Completeness (%) 


93.0 (81.0) 


97.5 (98.4) 


98.8 (98.8) 


96.5 (95.0) 


Temperature (°C) 


-186 


room (-21) 


4 


4 


Unit cell a (A) 


95.41 


97.10 


98.52 


97.71 


Unit cell b (A) 


96.04 


96.94 


96.50 


96.97 


Unit cell c (A) 


38.22 


38.63 


38.56 


38.52 


Refinement resolution (A) 


8-2.4 








Refined R*§ (%) 


20.2 








Rfree* {%) 


27.3 









*R 8ym = 1 00 x I^IJ/, (hkl)-</(hkl)>| / E^I/i (hkl). 'Value in 
parentheses is for highest resolution shell: Native (3), 2.28-2.20; 
Native (1), 3.02-2.90; Native (2), 2.49-2.40; KAu(CN) 2l 3.21-3.10. 
*R = 1 00 x ZJ |F 6 <hkl)|-|F c (hkl)| j /Zyu |F C (hkl)| where F 0 and F c are 
the observed and calculated structure factors, respectively (F c > 2o). 



§Model includes 275 protein residues and 1 82 water molecules. 
11 Calculated using a test set of 7% of the Native (3) data and 
subjecting the final refined model to simulated annealing and 
positional refinement with this test set not included in the 
minimization calculations. 



are adjacent to, the juxtamcmbrane region of VEGFR2. 
Residues 820-826 do seem to be part of the kinase domain 
as analogous residues are also ordered in the structures of 
FGFRl, IRK, and the non-receptor tyrosine kinase Lck 
[34]. Other differences between the VEGFR2A50P struc- 
ture and other kinase structures occur at the KID and the 
activation loop (discussed below). 

Catalytic loop and ATP-blndlng site 

In protein kinases, the loop between aE and (37 has been 
termed the catalytic loop as it contains an invariant aspartic 
acid residue (D1028) that is essential for catalysis of the 
phosphotransfcr reaction [27J. This aspartic acid is part of a 



stretch of residues (H1026-N1033) whose sequence HRD- 
LAARN (single-letter amino acid code) is highly conserved 
among protein tyrosine kinases. In VEGFR2A50P the back- 
bone position and most sidechain positions of this loop are 
similar to those in the unliganded FGFR1 and ternary 
phosphorylated IRK (IRKP) complex structures. As seen in 
these previous structures, the sidechain carboxylate of the 
catalytic loop aspartic acid (D1028) is hydrogen-bonded 
to the sidechains of the conserved arginine (R1032) and 
asparagine (N1033; Figure 4). 

The ATP-binding site of protein kinases lies at the cleft 
between the N- and C-terminal lobes (Figure 3c). For 



Figure 3 



Overall fold of VEGFR2A50P, FGFR1, and 
IRKP. Backbone representation of structures 
of the kinase domains of (a) VEGFR2 
(VEGFR2A50P), (b) FGFR1 (130), molecule 
A of Protein Data Bank [PDBJ entry 1 FGK), 
and (c) IRKP ([29], PDB entry 1IR3). The 
views shown in (a), (b), and (c) are identical 
views generated from superpositions of the 
C-terminal domains. The positions of the 
termini are denoted by N and C. In (a) j$ 
strands are designated as fi and a helices 
are indicated by a The nucleotide-binding 
loop (orange), KID (purple), and activation 
loop (yellow) are highlighted. In (c) the bound 
AMP-PNP is shown in green and the peptide 
substrate is shown in red. The figure was 
prepared with the program INSIGHT II [53]. 
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Figure 4 




Catalytic site of VEGFR2A50P and IRKP. 
Cross section of the catalytic site of (a) 
VEGFR2A50P and (b) IRKP [29] structures. 
Protein Cot ribbon representations are shown in 
light blue. In (b) the Ca ribbon representation 
of the bound peptide substrate is shown in 
orange. Specific atoms are colored by element 
type: carbon (green), oxygen (red), nitrogen 
(blue), phosphorous (purple) and magnesium 
ions (yellow), (a) includes only protein atoms, 
(b) includes protein atoms, peptide atoms, 
AMP-PNP atoms, and Mg 2+ ions. The figure 
was generated using INSIGHT II [53]. 



VEGFR2A50P, the residues forming this site consist pri- 
marily of residues E917-N923 that join the two lobes, and 
residues L840-1849 that include parts of |M, 02 and the 
glycine-rich loop of G841-G846. The glycine-rich loop, 
also referred to as the nucleotide binding loop, is a flexible 
segment whose position differs among kinase structures in 
various activated and liganded states. In VEGFR2A50P 
this loop is fairly well ordered and all atoms could be 
modeled, with the exception of the sidechains of R842 
and F845. The relative position and conformation of this 
loop is similar to that observed in the unliganded FGFR1 
structure. However, this position is markedly different from 
that in the IRKP ternary complex structure in which the 
approximately 20' relative rotation of the N- and C-termi- 
nal lobes results in the glycine-rich loop being 5 A closer 
to the C-terminal lobe than in VEGFR2A50P. 

In reported kinase structures with bound ATP or an ATP 
analog, the adenine ring makes two conserved hydrogen 
bonds with the protein backbone. In the structure of 
FGFR1 with adenylyl methylenediphosphate (AMP-PCP) 
bound [30], these hvdrogen bonds are between the adenine 
NH 2 and the backbone C-O of E562 (E9l 7 VEGFR2A50P) 
and between the adenine Nl and the backbone NH of 



A546 (C919 VEGFR2A50P). Although the structure pre- 
sented here docs not contain a bound nucleotide, the sim- 
ilarities in the positions of these backbone atoms to those 
in FGFR1 indicate that these hydrogen bonds would be 
formed in a VEGFR2A50P-ATP complex and therefore the 
adenine is expected to bind in a similar position (Figure 5). 

Variation in the ATP-binding sites of kinases involved 
in disease is of considerable importance in the design of 
selective ATP-competitive inhibitors as therapeutics. A 
comparison of the ATP-binding sites of FGFR1 and 
VEGFR2A50P reveals that while the overall architecture 
of the site is conserved, several sequence differences result 
in differences in the shape of the accessible area for ligand 
binding. Specific sequence differences between FGFR1 
and VEGFR2 in this site include V899 (1545 FGFR1), 
F918 (Y563 FGFR1), C9I9 (ASM FGFR1), and CI 045 
(A640 FGFR1; Figure 5). Similarly, comparison to the 
ternary IRKP complex, structure reveals variation in the 
adenine site at V916 (V11076 IRK), F918 (L1078), C919 
(M1079 IRK), L1035 (Ml 139 IRK), and G1045 (G1149 
IRK). Even greater sequence and structural variation in the 
adenine site is seen when the VEGFR2A50P structure is 
compared to serine/thrconine-kinase structures, suggesting 



Figure 5 



VEGFR2A50P 
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created with the pro-am Xfit [50]. 
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Figure 6 



Electron-density map of the kinase insert 
domain area of VEGFR2A50P. Stereoview of 
a 2F 0 -F C map computed at 2.4 A and 
contoured at 1 .2o\ and superimposed with 
the refined model. Carbon atoms are yellow, 
oxygen atoms red, and nitrogen atoms are 
blue. Water molecules are depicted as red 
crosses. The figure was created with Xfit [50]. 



that these differences could be exploited in the design of 
selective ATP-competitive inhibitors. 

Activation loop 

Protein kinases concain a large flexible loop, referred to as 
the activation loop (A loop), whose conformation is postu- 
lated to regulate kinase activity (Figure 3). In many kinases 
the conformation of the A loop is controlled by the phos- 
phorylation of specific A-loop residues [27]. The loop can 
he generally defined as beginning with the conserved 
residues DFG and ending at the conserved APE sequence 
[27], In VEGFR2 this segment corresponds to D1046- 
E1075 and contains two tyrosines (Y1054 and Y1059). Both 
Y1054 and Y1059 were found to be autophosphorylation 
sites when the cytosolic domain of VEGFR2 was expressed 
in Escherichia coli [25]. Using the in vitro autophosphoryla- 
tion protocol described here for VEGFR2A50, a stable 
phosphorylation site is indicated at Y1059; however, no evi- 
dence of phosphorylation of Y1054 was detected. 

In the VEGFR2A50P structure presented here, the A loop 
appears quite mobile and interprctable electron density 
was not present for most of the central portion of the loop 
(G1048-G1063). This disorder is consistent with mohility 
of the A loop deduced from other kinase structures. For 
example, of the two molecules in the asymmetric unit of 
the unphosphoryl a ted FGFRl kinase structure, the center 
of the A loop has relatively high temperature factors in 
molecule A and is completely disordered in molecule B. 
Although residues 1048-1063 could not be modeled in 
VEGFR2A50P, unambiguous electron density was present 
for residues D1064-E1075, clearly indicating that these 
residues adopt a conformation similar to that observed in 
the unphosphorylatcd FGFRl structure. The segment of 
D1064-P1068 has an extended structure that lies adjacent 
to the catalytic residues D1028 and R1032 (Figure 4a). 
Comparison with the structure of the MgAMP-PNP- 
pcptidc-IRKP complex indicates that the position of 
RI066-P1068 in this VEGFR2A50P structure is inhibitory 
to substrate binding. P1066 occupies the equivalent space 



allocated to the tyrosine sidechain of the peptide substrate 
in the ternary IRK3P complex structure. The conformation 
of residues L1069-E1075 is similar to that in the ternary 
[RKP complex structure. However, there is a complete 
directional change at PI 068 (P1172 IRK) between the two 
structures. In the IRK structure, residues N-terminal to 
this proline arc directed toward aEF, whereas in 
VEGFR2A50P they are directed toward aD on the oppo- 
site side of the protein (Figures 3 and 4). 

It is somewhat surprising that despite the phosphorylation 
of Y1059 prior to crystallization, the conformation seen here 
for residues D1064-P1068 is similar to the inhibitory' con- 
formation observed for analogous residues in the unphos- 
phorylated FGFRl structure. Y1059 in VEGFR2AS0 corre- 
sponds to a relatively conserved phosphorylation site among 
protein tyrosine kinases. In the ternary IRKP complex 
structure and the phosphorylated Lck structure [341, the 
tyrosine at this position (Y1163 IRK, Y394 Lck) is phos- 
phorylated and the A loop has a non-inhibitory conforma- 
tion similar to that observed in a phosphorylated cAPK 
ternary complex structure [35]. The interactions the phos- 
phate group at this position makes with other protein 
residues arc believed to help stabilize an A-Ioop confor- 
mation that allows substrate and ATP binding [27,29]. 
However, as the VEGFR2A.S0P structure described here 
does not exhibit a similar open A-loop conformation but 
rather has an inhibitory conformation with much of the 
loop disordered, it is possible that the monophosphory- 
lated A loop of VEGFR2A50P exists in a dynamic equilib- 
rium involving several conformations and that the confor- 
mation observed here is the one most favored in this 
crystal environment. 

Kinase insert domain 

The kinase insert domain occurs in the kinase: C- terminal 
lobe and connects helices aD and aE. In VEGFR2 this 
region corresponds to a 68-residue srretch from N933 to 
LI 000 (Figure 1). The lack of effect on nnrinsic kinase 
activity (noted above) of deletion of residues T940-E989 
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is perhaps not surprising, as the ends of the KID domain 
are relatively far away (approximately 35-40 A) from the 
catalytic site and arc on the opposite side of the protein 
from the position of the activation loop (Figure 3). These 
results agree with those for the CSF-1R kinase in which 
deletion of 58 of the 64 residues of the CSF-1R KID 
only decreased its ability to phosphorylate a peptide sub- 
strate by 10% [18]. Deletion of the entire 98 KID residues 
of PPDGFR, however, resulted in an 80% decrease in 
kinase activity towards a peptide substrate [36]. Consid- 
ered together, these results suggest that the majority of 
the KID is not required for catalysis but that a small number 
of residues must be present to form a linker between aD 
and aE to maintain a competent kinase structure. 

In the VEGFR2A50P structure, following aD, residues 
N933-P937 form a loose turn and an extended strand 
whose ends arc roughly perpendicular to the axes of aD 
and al at the C terminus. In difference Fourier maps the 
electron density is strong and clear for residues N933-P937 
and becomes weak for Y938 and K939 (sidechains of Y938 
and K939 are not modeled; Figure 6). The 50-residue 
deletion in VEGFR2A50 directly follows K939 so that the 
residue immediately C-terminal to K939 is V990, main- 
taining the residue numbering in full-length VEGFR2. 
Residues V990-K997 are disordered and interpretable 
electron density begins again at D998. Residues D998- 
T1001 then form a short strand that joins aE at residue 
L1002 (Figures 6 and 7). 

An interesting feature of these two strands at the N- and 
C-terminal ends of the KID is that they form a pseudo 
two-stranded parallel p-sheet structure that is quite differ- 
ent from the conformations seen in this region of other 
kinase structures. The two ends of the KID thus make a 
variety of interactions that may help to stabilize the overall 
conformation and position of this domain in VECJFR2. The 
sidechain of K931 makes an ionic interaction with the 
sidechain of E934 and also makes a hydrogen bond with the 
backbone carbonyl of D998 (Figure 7). Hydrogen-bond.ng 



interactions between the strands include E934 backbone 
G=0 to L1000 NH, V936 NH to L1000 C=0, and P937 
G=0 to L1002 NH. In addition to these polar interactions, 
the sidechains of F935, P937, and L1000 arc involved in 
extensive hydrophobic contacts. The sidechain of F935 is 
nestled in a hydrophobic pocket formed by the sidechains 
of L928, P937, LI 000, L1002, L1005, LI 101, and Y1130 
(Figures 6 and 7). The L1000 sidechain also packs against 
the sidechains of Y927, K931, H1004, and Y1008. 

The length, composition, and structure of the connecting 
region between aD and aE varies considerably among 
kinases. Foi brevity, only comparisons to the IRK and 
FGFRl receptor tyrosine kinase structures will be described 
here. In IRK this connection stretches twelve residues, 
of which five are proline, from P1093 to PI 104. This 
proline-rich segment is ordered and exhibits similar loop- 
like conformations in the unphosphorylated apoenzyme 
structure [28] and in the phosphorylated ternary complex 
structure [29|. The most significant divergence in the KID 
position in IRK and VEGFR2A50P begins immediately fol- 
lowing aD at N'933 (P1093 IRK). In IRK, residues P1093- 
G1100 form a loop that folds towards the N-tcrminal domain. 
By contrast, the path taken by N933-K939 in VEGFR2A50P 
is towards the C-terminal helix al (Figure 3a). 

In FGFR1 the connection between aD and aE is 18 
residues long (P578-L595) and shares little sequence 
homology with VEGFRZ or IRK. In FGFRl this region 
is apparently quite mobile because, for the two mol- 
ecules in the crystallographic asymmccric unit of the 
FGFRl structure, residues 580-591 are disordered in 
molecule A and residues 578-593 are disordered in mol- 
ecule B [30]. Although most of this segment is disor- 
dered in FGFRl, the ordered ends follow a path more 
similar to IRK than to VEGFR2A50P. 

Sequence comparisons with other members of the PDGFR 
family suggest that some of these kinases may have confor- 
mations for the beginning and end residues of the KID 



Figure 7 




Stereo cross section showing the ordered 
residues of the KID of VEGFR2A50P. 
Carbon atoms are yellow, oxygen atoms are 
red, nitrogen atoms are blue, and sulfur 
atoms are green. The view is rotated roughly 
1 80° from that of Figure 6. The figure was 
created with Xfit [50]. 
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similar to those seen here for VEGFR2A50F. In particular, 
there is significant sequence homology in the first four 
KID residues (N933-V936) that make the turn towards the 
C terminus. Of these residues, a phenylanine at the posi- 
tion of F935, which makes extensive hydrophobic interac- 
tions, may be particularly important for structural integrity 
as a phenylalanine occurs ac this position in sequence align- 
ments with VEGFR1, VEGFR3, aPDGFR, pPDGFR, and 
ckit [37-40]. 

Biological implications 

The crystal structure and kinetic characterization of a 
protein construct (VEGFR2A50) of vascular endothelial 
growth factor receptor 2 (VEGFR2), containing the 
kinase core and 18 of the 68 residues of the kinase insert 
domain (KID), provide several biological insights. The 
high similarity in overall fold and catalytic-residue posi- 
tions to the fibroblast growth factor receptor 1 (FGFR1) 
and insulin receptor kinase (IRK) kinase domain struc- 
tures suggests that the catalytic mechanism of VEGFR2, 
and other platelet-derived growth factor receptor PDGFR- 
family members, is consistent with other tyrosine kinases. 
Tyrl059, of the VEGFR2 kinase activation loop, was 
confirmed to be an autophosphorylation site. The activa- 
tion loop is predicted to exist in a dynamic equilibrium of 
several conformations, one of which is similar to a con- 
formation observed in FGFR1 and is inhibitory to sub- 
strate binding. 

As deletion of the 50 central residues of the KID did not 
affect autophosphorylation or kinase activity on an exoge- 
nous substrate, this domain does not seem to have a 
direct function in the phosphotransfer reaction. Although 
not directly involved in catalysis, evidence suggests that 
the KID functions in signal transduction as a binding 
site for cellular proteins. The VEGFR2 KID contains 
three tyrosines (Y938, Y951. and Y996) of which Y951 
and Y996 have previously been shown to be autophos- 
phorylation sites. The determination of these autophos- 
phorylation sites as binding sites for other proteins has 
yet to be reported for VEGFR2; however, phosphatidyl 
inositol 3 kinase, Grb2, and SHP-2 have been to shown 
to bind to phosphotyrosines in the KID of other PDGFR 
family members [411. 

In the structure reported here, the residues forming the 
two ends of the VEGFR2 KID interact with each other to 
form a pseudo-p substructure not observed in previous 
protein kinase structures. This substructure is stabilized 
by a large number of interactions and is expected to also 
occur when the entire KID is present. The close proximity 
of this pseudo-p substructure to the post-kin ase domain 
implies that the KID and post-kinase domains pack near 
to each other in full-length VEGFR2 and perhaps m other 
PDGFR family members. In this case, the pseudo-(i struc- 
tural element may serve to orient the KID to pack near to 



the post-kinase domain. As both the KID and the post- 
kinase domains of these receptor protein tyrosine kinases 
(RPTKs) are believed to be involved in the binding of 
other proteins, it is possible that having the KID and post- 
kinase domains in near proximity may facilitate formation 
of multimeric complexes for signal transduction. 

Materials and methods 

Cloning 

The coding sequence [42] (or the cytoplasmic domain of the VEGFR2 
was amplified by the polymerase chain reaction (PCR) [43] from a 
human aorta cDNA pool (Qontech, Palo Alto, CA). Two overlapping 
sequences were amplified independently: Vcyt (residues M806-V1 356), 
which represented the entire cytoplasmic domain, and Vcat (residues 
C817M-G1 191), with boundaries based upon a primary amino acid 
sequence alignment with the insulin receptor kinase catalytic domain 
[44]. The PCR oligonucleotide primer sequences for Vcyt were; Vcyt5: 
5'-CAGCATATGGAT(XAGATGAACTCCCATTGG-3' and Vcyt: 5'- 
GCGGTCGACTTAAACAGGAGGAGAGCTCAGTGTG-3'. The PCR 
oligonucleotide primer sequences for the Vcat were; Vcat 5 5'-GCA- 
C AT ATGG AACG ACTGCCTTATG ATGCC AGC-3' and Vcat3: 5'-CC- 
TGTCGACTTATCCAGAATCCTCTTCCATGCTCAAAG-3'. The amplified 
DNA was digested with the restriction enzymes Nde\ and Safl, ligated 
into the £ coii plasmid pET24a (Novagen Madison, Wl), and the 
sequence verified. When compared to the original sequence in 
GENBANK, accession number 346345, two nucleotide differences 
were noted that resulted in amino acid changes (Glu848-*Val and 
Asn835->Lys) in both Vcyt and Vcat. Our sequence agrees with 
subsequent VEGFR2 GENBANK submissions (accession numbers 
265541 2 and 3132833). 

Mutations were introduced by oligonucleotide site-directed mutagene- 
sis [45] using the Muta-Gene in vitro Mutagenesis Kit from Bio-Rad 
I Hercules, CA). The Vcat DNA fragment was subcloned from the 
pET24a vector using an A/del-X/iol digest into the vector pMGH4 
[46,47) and this vector was used to generate the single-stranded 
DNA (ssDNA) uracil-containing template (minus strand) in £ co/t 
strain CJ236 supplied in the kit. An oligonucleotide (5'-GCTCAGCA- 
GGATTGATAAGACTACATTGTTC-3') was designed to create a con- 
struct, Vcat(AGl 1 72-G1 191), which truncated the C terminus to 
residue D1 1 71 . Another oligonucleotide (5'-GAATTTGTCCCCTACAA- 
GGAAGCTCCTGAAGATCTG-3') was designed to delete the central 
50 residues (residues T940-E989) of the KID t on the basis of a 
sequence alignment with FGFR1 [30]. Sequence analysis detected an 
inadvertent Glu990-*Val mutation. All DNA modification and restric- 
tion enzymes were purchased from New England Biolabs and oligonu- 
cleotides were purchased from Genosys Biotechnology. 

The DNA encoding the VEGFR2A50 protein construct was made in 
several steps to combine the necessary mutations into the baculovirus 
expression vector pAcSG2 (Pharmingen. San Diego, CA). Step 1 : the 
coding region for Vcyt was subcloned from the pET24a vector into the 
Nco\-Kpn\ sites of vector pAcSG2. Step 2: a 2358 base pair (bp) 
Sca\-8gfi\ DNA fragment from plasmid pMGH4-Vcat(AT940-E989, 
E990V) was ligated to a 1695 bp Bg!L\-Scd DNA fragment from 
pMGH4-Vcat(AG1 172-G1 191) creating a pMGH4-Vcat(AT94Q-E989 1 
E990V, AG1172-G1191) vector. Step 3: a 913bp SsO-Haol DNA 
fragment, a pMGH4-Vcat(AT940-E989. E990V, AG1 172-G1 191). 
was ligated to a 3290bp Eaol-BsrEII DNA fragment from pAcSG2-Vcyt 
creating pAcSG2-Vcyt(AT940-E989, E990V, AG1 172-G1 191). This 
final DNA construct was sequenced, verified through the entire coding 
region and it was confirmed to contain only these known mutations from 
the wild-type sequence (Figure 1). 

The pAcSG2-Vcyt(AT940-E989, E990V, AG1172-G1 191) DNA, 
expressing the VEGFR2A50 protein, was transacted into Spodoptera 
fwgiperda (Sf9) cells with linearized baculovirus DNA according to the 
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protocol of the manufacturer (Pharmingen, San Diego, CA). Single 
plaques were isolated from this transfection and high-titer stocks gener- 
ated. All stocks were examined by isolation of baculoviral DNA and PCR 
amplification of the insert using the potyhedrin forward and reverse 
primers (Invitrogen). Sf21 cells were infected at 1-1.5 million cells/ml at 
a multiplicity of infection of 5 for 72 h and harvested by centrifugation. 

Protein purification 

Cell pellets were lysed by d ounce homogenization and sonication in 
20 rnM Tris pH 8.0, 20 mM NaCI, 5 mM dithiothreitol (DTT), and 5% 
(v/v) glycerol. The lysate was centrifuged for 50 minutes at 35,000 rpm 
in a T145 rotor. The soluble fraction was loaded onto a 40 ml Q-30 
anion-exchange column (Pharmacia, Piscataway, NJ) and eluted with a 
20-600 mM NaCI gradient in 20 mM Tris pH 8.0, 5 mM DTT, and 5% 
(v/v) glycerol over 20 column volumes. VEGFR2A50 was pooled by 
sodium dodecyl sulfate poryacrylarnide gel electrophoresis (SDS-PAGE) 
analysis and by the presence of kinase activity as measured against 
gastrin peptide substrate (Boehringer Mannheim, Indianapolis, IN). 
Pooled material was loaded onto a 40 ml hydroxyapatite (Bio-Rad, Her- 
cules, CA) column and washed extensively with 20 mM Tris pH 8.0, 
50 mM NaCI, 5 mM DTT, and 5% glycerol. Protein was eluted using a 
500 ml linear gradient from 0-50 mM potassium phosphate pH 8.0, 
50 mM NaCI, 5 mM DTT, and 5% glycerol. VEGFR2A50 was pooled 
by SDS-PAGE analysis and by the presence of kinase activity as mea- 
sured against the gastrin peptide. Material from this column was then 
diluted 1:1 with 20 mM Tris pH 8.0, 20 mM NaCI, 5 mM DTT, and 5% 
glycerol and loaded onto an 8 ml Q-15 anion-exchange column 
(Pharmacia). Protein was eluted using a 1 80 ml linear NaCI gradient 
(20-175 mM) in 20 mM Tris pH8.0, 5 mM DTT, and 5% glycerol. 
VEGFR2A50 was pooled as described above. (NH 4 ) 3 S0 4 (4M) was 
added to the pool to a final concentration of 0.6 M and the pooled 
material was loaded onto a 1 0 ml HP-phenyl sepharose column (Phar- 
macia). VEGFR2A50 was eluted using a 200 ml linear reverse gradient 
from 0.6-0 M (NH 4 ) 2 S0 4 in 20 mM Tris and 5 mM DTT. Purified 
VEGFR2A50 was buffer exchanged into 50 mM Hepes pH 7.5, 1 0 mM 
DTT, 10% glycerol, and 25 mM NaCI over a 500 ml G-25 column (Phar- 
macia) and concentrated to 1 mg protein/ml through a 10kDa cutoff 
polysulfone membrane (Millipore, Bedford, MA). Final material was 
aliquoted and flash frozen in liquid N 2 and stored at -70°C 

Kinetic assays 

Kinetic assays of phosphorylated VEGFR2 were performed with puri- 
fied VEGFR2A50 that was autophosphorylated under the following 
conditions: protein (4 mM), ATP (3 mM), MgCI 2 (40 mM), DTT (5 mM), 
in Hepes (100 mM), 10% glycerol, pH 7.5, at 4°C for 1 h. Analysis of 
data was carried out as described in [321. 

Coupled spectrophotometry assay for the forward reaction 
Tyrosine-kinase assays were monitored using a Beckrnan DU 650 
spectrophotometer. Production of ADP was coupled to oxidation of 
NADH using phosphoenolpyruvate (PEP) through the actions of pyru- 
vate kinase (PK) and lactic dehydrogenase (LDH). The oxidation of 
NADH was monitored by following the decrease in absorbance at 
340 nm (£340 = 6.22 cm^TTiM" 1 ). Typical reaction solutions contained 
1 mM PEP, 250 mM NADH, 50 units of LDH/ml, 20 units of PK/ml, 
5 mM DTT, in 200 mM Hepes, pH 7.5, and indicated concentrations of 
poly(E 4 Y 1 ) (Sigma, St. Louis, MO), ATP and MgCI 2 . Assays were initi- 
ated with either VEGFR2A50P (40 nM) or VEGFR2A50 (70 nM). For 
nonphosphorylated enzyme, K MlMgfaP) was determined by varying the 
ATP concentration at fixed concentrations of Mg a+ and poly(E 4 Y) (60 
and 20 mM, respectively). For phosphorylated enzyme, K M(M ^ m was 
determined by varying the ATP concentration at fixed concentrations of 
Mg 2+ and poly(E 4 Y) (25 and 5 mM, respectively). KufaAytm for (ne 
nonphosphorylated enzyme was determined by varying the concentra- 
tion of poty(E 4 Y) at fixed concentrations of Mg 2+ and ATP (65 and 2 mM, 
respectively). K M (poiyE4Y> for tne phosphorylated enzyme was deter- 
mined by varying the concentration of poly(E 4 Y) at fixed concentrations 
of Mg 2+ and ATP (25 and 5 mM, respectively). For nonphosphorylated 
enzyme, K MtMg2+) was determined by varying the Mg 2+ concentration at 



fixed concentrations of ATP and poly(E 4 Y) (2 and 1 8 mM, respectively). 
For phosphorylated enzyme, rf M(Mg2+ .j was determined by varying the 
Mg 2+ concentration at fixed concentrations of ATP and poly(E 4 Y) (1 
and 5 mM, respectively). 

Coupled spectrophotometry assay for the reverse reaction 
ATP generation was coupled to production of NADH via the action of 
hexokinase (HK) and glucose-6-phosphate dehydrogenase (G6PD). In 
this assay, HK catalyzes the conversion of ATP to ADP and glucose-6- 
phosphate. Glucose-6- phosphate is then oxidized to D-6-phosphoglu- 
COnopyranose-1 ,5-lactone by G6PD, with concomitant reduction of NAD 
to NADH, which can be monitored at 340 nm. Typical assay solution 
contained glucose (1 0 mM), NAD (40 mM), DTT (5 mM). MgCl 2 (4 mM), 
HK (15 unit/ml), G6PD (15 units/ml) and indicated concentrations of 
ADP and phospho*po!y(E 4 Y). The reactions were initiated with addition 
of either VEGFR2A50P (600-900 nM) or VEGFR2A50 {950 nM). For 
the phosphorylated enzyme, ^ M(M9ftDP j was determined by varying the 
ADP concentration at fixed concentrations of Mg 2+ and p-poly(E 4 Y) (10 
and 8 mM, respectively). For the non-phosphoryiated enzyme, KfcifMgADP) 
was determined by varying the ADP concentration at fixed concentra- 
tions of Mg 2 * and p-poly(E 4 Y) (10 and 5mM, respectively). ^tp-p^Y) 
for the both the phosphorylated and non-phosphorylated enzyme was 
determined by varying the concentration of p-pofy(E 4 Y) at fixed concen- 
trations of Mg 2+ and ADP (10 and 1 mM, respectively). 

In vitro autophosphorylation for crystallization and mass 
spectrometry 

Aliquots of frozen VEGFR2A50 were thawed by immersion in cold H 2 0 
and pooled at 4°C. MgCI 2 and ATP were added to 26 mM and 4 mM, 
respectively. VEGFR2A50 was incubated at 4°C for 1 h. This material 
(VEGFR2A50P) was then buffer exchanged into a solution of 1 0 mM 
Hepes pH 7.5, 1 0 mM DTT, and 1 0 mM NaCI, and was concentrated 
using a Centriprep-1 0 (Millipore) to 5 mg protein/mi. 

Mass spectrometry: trypsin digestion 

Trypsin digestions of purified VEGFR2A50 and VEGFR2A50P were 
conducted at 37*C using 0.37 mg/ml protein in 25 mM NH 4 HC0 3 at 
pH 7.7 with a reaction volume of 100 u.1 for two days. 

Mass spectrometry: MALDI MS 

Matrix-assisted laser desorption mass spectrometry (MALDI MS) analy- 
ses were performed in a Voyager-Elite, time-of-flight mass spectrome- 
ter with delayed extraction (PerSeptive Biosystems, Inc., Framingham, 
MA). A volume of 1 uJ of digested protein sample was mixed with 1 pi 
of matrix (a-cyano-4-hydroxy-cinnamic acid) in a solution of 50% (v/v) 
acetonitriie and 0.25% (w/w) trifluoroacetic acid in water. Samples 
were irradiated with a nitrogen laser operated at 337 nm. 

Mass spectrometry: nanoESI MS 

Nanoelectrospray ionization (NanoESI) MS analyses were performed 
on a triple quadrapole mass spectrometer (PE Sciex API HI, Alberta. 
Canada) modified with a nanoESI source from Protana A/S (Denmark). 
The ESI volatage was set at 700 V and the orifice settings were main- 
tained at 100 V. A volume (3 jit) of digested protein was mixed with 
7 jjl! of methanol and 0.5 pi formic acid, and then 4 pi of this sample 
was injected into the mass spectrometer. Ion scans were used to 
obtain the sequence of phosphopeptides. 

Crystallization 

Purified phosphorylated VEGFR2A50 was concentrated on average to 
5 mg protein/ml using a Centricon-10 centrifugal concentrator. Crystals 
were grown by the hanging-drop vapor-diffusion method at 4°C. Drops 
containing 2 uJ of protein solution and 2 pi of a motheHiquor solution 
(1 00 mM Hepes pH 7.2, 2 M (NH 4 ) 2 S0 4 , and 2% (v/v) monomethylether 
polyethylene glycol, molecular weight (MW)=550kDa) were equili- 
brated above a 1 ml reservoir of the mother-liquor solution to which 
50 mM p-mercaptoethanol had been added. Oystals appeared after 
3-4 days and grew to as large as 0.3 x 0.2 x 0.5 mm over 21 days. 
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Western-blot analysis of crystals 

Crystals were removed from the crystallization drops with fiber loops, 
washed four times in 5 |il of mother-liquor solution* and dissolved in 25 u I 
of a solution containing 50 mM Hepes pH 7.5 and 1 0% DMSO. The 
crystallization sample was run on an SDS polyacrylamtde gel along with 
control samples of approximately 0.1 5 pg of purified non phosphorated 
VEGFR2A50 and VEGFR2 A50 after 20 min and 40 min of the autophos- 
phorylation reaction. Kinase activity in all samples was stopped by bring- 
ing the final solution to 25 mM Hepes pH6.0, 40 mM EDTA pH8.0, 
10% glycerol, and 40 mM NaCl. The western blot was performed using 
the ECL blotting kit from Amersham Inc. (Arlington Heights, IL). The 
primary antibody solution was a 1:1000 dilution of mouse anti-phospho- 
tyrosine mAb PY20 and the secondary antibody was a 1 :1 500 dilution of 
goat anti-mouse horseradish peroxidase conjugate. Both antibodies were 
obtained from Transduction Laboratories Inc. (Lexington, KY). 

Data collection 

X-ray diffraction data sets were collected using a Rigaku RU-200 rotat- 
ing anode X-ray generator (CuKa) operated at 50 kV and 1 00 mA and 
equipped with Supper focusing mirrors and a MAR345 MAR Research 
image plate detector. Data collection on frozen crystals was performed 
by transferring a crystal into a cryoprotectant solution (100 mM Hepes 
at pH 7.2, 2.2 M (NH 4 ) 2 S0 4 , 0.6 M sucrose, 0.55 M glucose, and 2% 
(v/v) monomethylether polyethylene glycol MW = 550 kDa), flash freez- 
ing the crystal in liquid nitrogen, and then transferring the frozen crystal 
into a stream of nitrogen at -186°C Data were integrated and scaled 
using the programs DENZO and SCALEPACK (48] Data-collection 
statistics are given in Table 2. 

Structure determination and refinement 

Initial protein phases were obtained using the AMoRe molecular- 
replacement program [49], molecule 1 of the FGFR1 structure (30] 
(PDB entry tFGK) as a search probe, and the nativel data set. The 
correct solution was achieved by including the FGFR1 sidechains 
and removing mobile residues of the activation loop (640-660), the 
N terminus (464-467), a short loop (517-520), and the C termius 
(760-762) from the search model. The correct solution was the top 
peak in the rotation and translation functions and had a correlation 
coefficient of 0.31. Rigid-body refinement in AMoRe improved the 
solution to a correlation coefficient of 0.49 and an R factor of 46.3% 
in the 1 2.0-4.0 A resolution range. The correctness of this solution was 
cross-checked by calculation of a difference Fourier with a KAu(CN) 2 
derivative. This derivative was generated by soaking a crystal for 
three days in reservoir solution containing 0.5 mM KAu(CN) 2 and then 
increasing the heavy-atom concentration to 5 mM and soaking for an 
additional 64 h. Scaling of data sets, Patterson calculations, Fourier 
calculations, and the generation of phases were performed using the 
program package Xtalview [50]. 

Refinement of the model was carried out using X-PLOR version 3.1 
(51). Calculation of electron -density maps and model fitting was per- 
formed using XtalView (50). Refinement was begun using a data set 
collected at 4°C (native 2) and was completed using a data set 
(native3) collected at -186 °C. The final R factor is 20.2% for data in 
the range 8-2.4 A (F 0 >2o). The average B value for all atoms is 
31.8 A 2 for protein atoms and 42.8 A 2 for water molecules. The final 
model includes residues 820-939, 998-1047, and 1064-1168, Of 
these residues, the sidechains of K833, RB42. F845, Y938, K939, 
D998, K1023, R1027. Y1038, K1039, K1110, and E1113 could not 
be modeled beyond Cp because of a lack of interpretable density. 
Analysis of matnehain torsion angles using PROCHECK [52] shows 
that, of the 275 residues in the model, none occurs in the disallowed 
region and only four occur in the generously allowed region of a 
Ramachandran plot. Water molecules, of which there were 1 82, were 
fit to electron-density peaks that were greater than 3a and were 
located in positions for making reasonable hydrogen bonds to the 
protein or other water molecules. Superpositions of various kinase 
structures were performed using the graphics program Insight II (Mol- 
ecular Simulations Inc., San Diego, CA) [53] . 



Accession numbers 

The coordinates for VEGFR2A50P have been deposited in the Protein 
Data Bank with accession code 1vr2 and will be available one year 
after the publication date of this article. 
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